New approaches to measuring the performance of programs that generate differential diagnoses using ROC curves and other metrics
نویسندگان
چکیده
INTRODUCTION Evaluation of computer programs which generate multiple diagnoses can be hampered by a lack of effective, well recognized performance metrics. We have developed a method to calculate mean sensitivity and specificity for multiple diagnoses and generate ROC curves. METHODS Data came from a clinical evaluation of the Heart Disease Program (HDP). Sensitivity, specificity, positive and negative predictive value (PPV, NPV) were calculated for each diagnosis type in the study. A weighted mean of overall sensitivity and specificity was derived and used to create an ROC curve. Alternative metrics Comprehensiveness and Relevance were calculated for each case and compared to the other measures. RESULTS Weighted mean sensitivity closely matched Comprehensiveness and mean PPV matched Relevance. Plotting the Physician's sensitivity and specificity on the ROC curve showed that their discrimination was similar to the HDP but sensitivity was significantly lower. CONCLUSIONS These metrics give a clear picture of a program's diagnostic performance and allow straightforward comparison between different programs and different studies.
منابع مشابه
Assessment of Structure-Specific Fragility Curves for Soft Storey Buildings Implementing IDA and SPO Approaches
Soft storey building is popular due to the functional and aesthetic purpose, despite its weakness in resisting seismic excitation. Nonlinear Static (Pushover) Analysis (POA) is a time saving and simple assessment procedure prosposed in Eurocode 8 (EC8). However, its reliability in designing structure still remains a question. At the first stage, seismic performance of several building models us...
متن کاملReview of ranked-based and unranked-based metrics for determining the effectiveness of search engines
Purpose: Traditionally, there have many metrics for evaluating the search engine, nevertheless various researchers’ proposed new metrics in recent years. Aware of this new metrics is essential to conduct research on evaluation of the search engine field. So, the purpose of this study was to provide an analysis of important and new metrics for evaluating the search engines. Methodology: This is ...
متن کاملDifferential genes expression analysis of invasive aspergillosis: a bioinformatics study based on mRNA/microRNA
Invasive aspergillosis is a severe opportunistic infection with high mortality in immunocompromised patients. Recently, the roles of microRNAs have been taken into consideration in the immune system and inflammatory responses. Using bioinformatics approaches, we aimed to study the microRNAs related to invasive aspergillosis to understand the molecular pathways involved in the disease pathogenes...
متن کاملCompareDx: a Software Toolkit for Measuring the Performance of Programs that Generate Multiple Diagnoses
Introduction Evaluations of medical diagnosis programs have been carried out for several decades but for programs which produce multiple diagnoses there is a lack of suitable, well validated performance metrics. If a program reasons about only one (or a few) types of diagnosis, then the sensitivity and specificity of the program can readily be determined given a suitable standard diagnosis. How...
متن کاملA CROC stronger than ROC: measuring, visualizing and optimizing early retrieval
MOTIVATION The performance of classifiers is often assessed using Receiver Operating Characteristic ROC [or (AC) accumulation curve or enrichment curve] curves and the corresponding areas under the curves (AUCs). However, in many fundamental problems ranging from information retrieval to drug discovery, only the very top of the ranked list of predictions is of any interest and ROCs and AUCs are...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
- Proceedings. AMIA Symposium
دوره شماره
صفحات -
تاریخ انتشار 2000